Predicting phenotype from patterns of annotation

نویسندگان

  • Oliver D. King
  • Jeffrey C. Lee
  • Aimée M. Dudley
  • Daniel M. Janse
  • George M. Church
  • Frederick P. Roth
چکیده

MOTIVATION Predicting the outcome of specific experiments (such as the growth of a particular mutant strain in a particular medium) has the potential to allow researchers to devote resources to experiments with higher expected numbers of 'hits'. RESULTS We use decision trees to predict phenotypes associated with Saccharomyces cerevisiae genes on the basis of Gene Ontology (GO) functional annotations from the Saccharomyces Genome Database (SGD) and other phenotypic annotations from the Yeast Phenotype Catalog at the Munich Information Center for Protein Sequences (MIPS). We assess the methodology in three ways: (1) we use cross-validation on the phenotypic annotations listed in MIPS, and show ROC curves indicating the tradeoff between true-positive rate and false-positive rate; (2) we do a literature-search for 100 of the predicted gene-phenotype associations that are not listed in MIPS, and find evidence for 43 of them; (3) we use deletion strains to experimentally assess 61 predicted gene-phenotype associations not listed in MIPS; significantly more of these deletion strains show abnormal growth than would be expected by chance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PECTIC ENZYME PATTERNS OF FUSARIUM OXYSPORUM VIRULENT ISOLATES FROM CHICKPEA IN IRAN

The pectic enzymes produced in vitro by 8 isolates (5 Highly virulent and 3 Weakly virulent) of Fusarium oxysporum , were detected by spectrophotometry, and characterized by polyacrylamide gel electrophoresis with substrate-containing gels (zymogram). Analysis of the polygalacturonase (PG) isozyme banding patterns (zymogram) identified two distinct phenotypes among the isolates from chickpea (C...

متن کامل

An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies

A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...

متن کامل

Primary root growth, tissue expression and co-expression analysis of a receptor kinase mutant in Arabidopsis

There is no functional annotation for the majority of the several hundreds of receptor-like kinases in plants. A direct way of inferring the function of these proteins is to study the phenotype that results from loss of function mutants such as T-DNA mutant lines. In this research a function (phenotype) to At2g37050 gene that encodes a receptor like kinase in Arabidopsis T-DNA line was...

متن کامل

Detection of magA Gene in Klebsiella spp. Isolated from Clinical Samples

  Objective(s): Klebsiella infections are caused mainly by K. pneumoniae and K. oxytoca. In the last two decades, a new type of invasive Klebsiella pneumoniae which contains mucoviscosity-associated gene (magA) has emerged. The aim of this study was to investigate the prevalence of magA gene and to detect antimicrobial susceptibility patterns of Klebsiella   spp. isolated from clinical samples....

متن کامل

Predicting the Family Function based on Early Maladaptive Schemas and Couples Communication Patterns (Case Study: Education)

Purpose: The aim of this research was predicting the family function based on early maladaptive schemas and couple’s communication patterns. Methodology: Present study was descriptive from type of correlation. The research population was married female employees working in public and non-public schools of Tehran city and their spouses in 2017-2018 academic years. The research sample was 482 peo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 19 Suppl 1  شماره 

صفحات  -

تاریخ انتشار 2003